Universität Augsburg Audio Brush : Editing Audio in the Spectrogram

نویسنده

  • R. Lienhart
چکیده

A tool for editing audio signals in the spectrogram is presented. It allows manipulating the spectrogram of a signal at any chosen time-frequency resolution directly and to reconstruct the edited signal in HiFi quality – a capability that is usually not possible with the Fourier or wavelet transformation. Image processing and computer vision methods are applied to the spectrogram in order to identify, separate, eliminate and/or modify audio objects visually. As spectrograms give descriptive information about the sound, this tool allows editing audio in a “what you see is what you hear” style. This is enabled by a thorough investigation and exploitation of Gabor analysis and synthesis. We further propose to use a kind of zooming, as in visual painting tools, which results in a change of time and frequency resolution, and can be adapted for the task at hand. Results on applying this tool to erasing audio objects such as whistles, music, clapping and alike in audio tracks are presented. Hence audio objects are automatically identified as visual objects in the spectrogram and eliminated therein. The cleaned signal is then reconstructed from the spectrogram in HiFi quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Universität Augsburg Audio Brush : Smart Audio Editing in the Spectrogram

Starting with a novel audio analysis and editing paradigm, a set of new and adaptive audio analysis and editing algorithms in the spectrogram are developed and integrated into a smart visual audio editing tool in a “what you see is what you hear” style. At the core of our algorithms and methods is a very flexible audio spectrogram that goes beyond FFT and Wavelets and supports manipulating a si...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

Visual Audio: An Interactive Tool for Analyzing and Editing of Audio in the Spectrogram

We present a tool for analyzing and editing audio signals in the visual domain. As visual representation we use spectrograms, which give descriptive information about the sound. This allows analysing and editing audio in a " what you see is what you hear " style. Gabor analysis and synthesis serves as a basis to create images and recreate audio signals from the edited images in hi-fi quality. A...

متن کامل

Mid-level Features for Audio Chord Estimation using Stacked Denoising Autoencoders

Deep neural networks composed of several pre-trained layers have been successfully applied to various tasks related to audio processing. Stacked denoising autoencoders represent one type of such networks. They are discussed in this paper in application to audio feature extraction for audio chord estimation task. The features obtained from audio spectrogram with the help of autoencoders can be u...

متن کامل

Spectrogram Factorization Using Phase Information

Spectrogram factorization methods have been proposed for single channel source separation [1–4], audio analysis [5–8] and more recently multichannel source separation [9–11]. All spectrogram factorization approaches incorrectly assume that the mixture spectrogram is the sum of the source spectrograms. In fact, the mixture spectrogram depends on the source spectrograms and the phase difference b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005